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Abstract. The Gibbs distribution for n particles of the Coulomb gas on the 
unit circle at inverse temperature j3 is given by 

KU) = ^— /•■■/ /(e i9 V..,e i9 ")|A(e^, 

for any symmetric function /, where A denotes the 
and Z n g the normalization constant. 

We will describe an ensemble of (sparse) random 
follow this distribution. Our approach combines 
of orthogonal polynomials on the unit circle with 
Dumitriu and Edelman. In particular, we resolve a 
find a tri-diagonal model for the Jacobi ensemble. 



matrices whose eigenvalues 
elements from the theory 
ideas from recent work of 
question left open by them: 



1. Introduction 

In 1962, Dyson [H] introduced three ensembles of random unitary matrices with 
a view to simplifying the study of energy level behavior in complex quantum sys- 
tems. Earlier work in this direction, pioneered by Wigner, focused on ensembles of 
Hermitian matrices. 

The simplest of these three models is the unitary ensemble, which is just the 
group U(n) ofnxn unitary matrices together with its Haar measure. The induced 
probability measure on the eigenvalues is given by the Weyl integration formula (cf. 
[171 §VII.4]): for any symmetric function of the eigenvalues, 
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(1-1) E(/) = ~r / ••• / fie Wl 



•)|A(e* 



, \2d9i d6 n 



where A denotes the Vandermonde determinant, 



(1.2) 



A (21, . . . , Zn) = Yl ( Zk 

l<j<k<n 



The orthogonal ensemble consists of symmetric n x n unitary matrices together 
with the unique measure that is invariant under U 1— > W T UW for all W € U(n). 
Alternatively, if U is chosen according to the unitary ensemble, then U T U is dis- 
tributed as a random element from the orthogonal ensemble. The distribution of 
eigenvalues is given by (|l.l(l but with |A| 2 replaced by |A| and a new normalization 
constant. 
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The symplectic ensemble is a little more complicated. Let Z denote the 2n x 2n 
block diagonal matrix 



(1.3) 



1 

-1 



1 

-1 



and define the dual of a matrix by U — Z T U T Z. The symplectic ensemble consists 
of self-dual unitary 2n x 2n matrices; the measure is that induced from U(2n) by 
the map U i— > U R U. (This is the unique measure invariant under V i— > W R VW for 
all W S U(2n).) The eigenvalues of such matrices are doubly degenerate and the 
pairs are distributed on the circle as in (|l.lfl but now with |A| 2 replaced by |A| 4 . 
Again, the normalization constant needs to be changed. 

Dyson also observed that these eigenvalue distributions correspond to the Gibbs 
distribution for the classical Coulomb gas on the circle at three different tempera- 
tures. Let us elaborate. 

Consider n identically charged particles confined to move on the unit circle in 
the complex plane. Each interacts with the others through the usual Coulomb 



potential, — log \ zi 



which gives rise to the Hamiltonian 



H(zi, ...,Zn) 



E 

l<j<k<n 



\0g\Zj 



(One may add a kinetic energy term; however, as we are interested only in the 
distribution of the particle positions, it has no effect.) This gives rise to the Gibbs 
measure (with parameters n, the number of particles, and f3, the inverse tempera- 
ture) 



(1-4) K(f) 
(1.5) 
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f(e^ , . . . , e"» ) e ~^ ei d0 1 ■ ■ ■ d0 
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(2n) n Z n ,p 

for any symmetric function /. The partition function is given by 



e i6 ")f d9i---d9 n 



(1.6) 



r(|/fa + i) 



as conjectured by Dyson. This was proved by Gunson and Wilson [T^], though 
the Good proof JOI is even better. We give another proof at the end of Section 

From the discussion above, we see that the orthogonal, unitary, and symplectic 
ensembles correspond to the Coulomb gas at inverse temperatures = 1, 2, and 4. 

Viewed from the opposite perspective, one may say that Dyson provided matrix 
models for the Coulomb gas at three different temperatures. Our first goal here is 
to present a family of matrix models for all temperatures. These matrices will be 
sparse — approximately 4n non-zero entries — which suggests certain computational 
advantages. To state the theorem, we need the following 
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Definition 1.1. We say that a complex random variable, X, with values in the 
unit disk, D, is 0„-distributed (for v > 1) if 

(1.7) E{f(X)} = ^ ff f(z)(l - \z\ 2 )^' 2 d 2 z. 

J Jo 

For v > 2 an integer, this has the following geometric interpretation: If v is chosen 
from the unit sphere S v in M l ' +1 at random according to the usual surface measure, 
then vi + 1V2 is 8^-distributed. (See Corollary I A. 31 1 

As a continuation of this geometric picture, we shall say that X is 0!-distributed 
if it is uniformly distributed on the unit circle in C. 

Let us now describe the family of matrix models. 

Theorem 1. Given (3 > ; let au ~ 0/3(n-fc-i)+i be independent random variables 
for < k < n — 1, Pk = yl — \ctk\ 2 , and define 

" |_Pfc -«fe_ 

/or < k < n — 2, while S_i = [1] and S„_i = are lxl matrices. From 

these, form the n x n block- diagonal matrices 

L = diag(S ,S 2 ,S 4 , . . .) and M = diag(H_i, H 1( 3 3 , . . . ). 

Both LM and ML give (sparse) matrix models for the Coulomb gas at inverse 
temperature (3. That is, their eigenvalues are distributed according to ()1.5fl . 

Remark. As each of the is unitary, so are L and M. (In the case of S n _i, we 
should reiterate that a n -± ~ 0i is uniformly distributed on the unit circle.) As a 
result, the eigenvalues of LM and ML lie on the unit circle. Note also that, since 
M conjugates one to the other, LM and ML have the same eigenvalues. 

In proving this theorem, we will be following the recent paper of Dumitriu and 
Edelman |S] rather closely, while incorporating the nuances of the theory of poly- 
nomials orthogonal on the unit circle. The matrices L and M that appear in the 
theorem have their origin in the work of Cantero, Moral, and Velasquez 3 ; this is 
discussed in Sectional 

Dumitriu and Edelman constructed tri-diagonal matrix models for two of the 
three standard examples of the Coulomb gas on the real line. A model for the third 
will be constructed below. 

The simplest way to obtain a normalizable Gibbs measure on the real line is to 
add an external harmonic potential V(x) = \x 2 . This gives rise to the probability 
measure 

(1.8) E(/)oc [■■■[ f(x 1 ,...,x n )\A(x 1 ,...,x n )fl[e- v ^dx 1 ---dx n 

J J . 

on R ra . This is known as the Hermite ensemble, because of its intimate connection 
to the orthogonal polynomials of the same name, and when /3 = 1, 2, or 4, arises as 
the eigenvalue distribution in the classical Gaussian ensembles of random matrix 
theory. Dumitriu and Edelman showed that l|1.8|) is the distribution of eigenvalues 
for a symmetric tri-diagonal matrix with independent entries (modulo symmetry). 
The diagonal entries have standard Gaussian distribution and the lower diagonal 
entries are 2 -1 / 2 times a %-distributed random variable with the number of degrees 
of freedom equal to j3 times the number of the row. 
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The second example treated by Dumitriu and Edclman is the Laguerre ensemble. 
In statistical circles, this is known as the Wishart ensemble, special cases of which 
arise in the empirical determination of the covariance matrix of a multivariate 
Gaussian distribution. For this ensemble, one needs to modify the distribution 
given in (|1.8|) in two ways: each particle Xj is confined to lie in [0, oo) and is subject 
to the external potential V(x) — — alog(a;) + x, where a > —1 is a parameter. In 
PI, it is shown that if B is a certain n x n matrix with independent ^-distributed 
entries on the main and sub-diagonal (the number of degrees of freedom depends on 
a, (3, and the element in question) and zeros everywhere else, then the eigenvalues 
of L = BB T follow this distribution. 

The third canonical form of the Coulomb gas on ffi. is the Jacobi ensemble. The 
distribution is as in 1(1. 8|) . but now the particles arc confined to lie within [—2,2] 
and are subject to the external potential V{x) = — alog(2 — x) — 61og(2 + x), where 
a.b > —1 are parameters. This corresponds to the probability measure on [—2, 2] n 
that is proportional to 

(1.9) | A(xi, . . . , x n )f Y[ (2 - X] ) a {2 + Xj ) h dx x --- dx n . 

j 

The partition function (or normalization coefficient) was determined by Selberg |14j . 
This will be discussed in Sectional 

Dumitriu and Edclman did not give a matrix model for this ensemble, listing it 
as an open problem. We present a tri-diagonal matrix model in Theorem [21 below. 
The independent parameters follow a beta distribution: 

Definition 1.2. A real- valued random variable X is said to be beta-distributed 
with parameters s,t > 0, which we denote by X ~ B(s, t), if 

(1.10) E{/(X)} = ^g^lp J 1 f( x )(l - xy-'il + xf- 1 dx. 

Note that B(^,^) is the distribution of the first component of a random vector 
from the ^-sphere. (See Corollary IA. 31 ) 



Theorem 2. Given (3 > 0, let ak, < k < 2n — 2, be distributed as follows 
(1.11) a k 



B( 2n-k-2 p + a+1; 2n-k-2 p + b + ^ k eveU; 
B ( 2n-k-3 p + a + b + 2l 2n -^- 1 /3) k Odd. 



4 

Let ct2 n -i = ot—i = — 1 and define 

(1-12) b k +i = (1 - a 2 k-i)ot2k - (1 + a 2 k-i)a2k-2 

1 /2 

(1-13) a k +i = {(1 - a 2 fc-i)(l - a 2fc )(l + a 2 k+i)} 

for < k < n — 1; then the eigenvalues of the tri-diagonal matrix 



a n -i b n 

are distributed according to the the Jacobi ensemble 1)1.9(1 . 



ai b 
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Wc know of two other papers which discuss the Jacobi ensemble in a manner 
inspired by the work of Dumitriu-Edclman: J3 §4.2] and These results are 

however of a rather different character; in particular, we contend that Theorem [21 
is the true Jacobi ensemble analogue of the results of 5 . 

In Section |B1 we show how the ideas developed in the earlier parts of this paper 
lead to new derivations of the classical integrals of Aomoto and Selberg |14| . 
The main novelty of these proofs is their directness: They treat all values of (3 and 
n on an equal footing. In particular, we do not prove the result for (3 an integer 
and then make recourse to Carlson's Theorem. These remarks are also applicable 
to the proof of (|1.6() given at the end of Section 01 

Acknowledgements: We would like to thank Barry Simon for encouragement and 
for access to preliminary drafts of his forthcoming book . 

2. Overview of the Proofs and Background Material 

We begin by examining the (3 — 2 case of Theorem ^ that is, Haar measure on 
the unitary group. 

Rather than study the eigenvalues as the fundamental statistical object, we will 
consider the spectral measure associated to U and the vector e\ = (1, 0, . . . , 0) T . It 
will be denoted by dfi. As Haar measure is invariant under conjugation, any choice 
of unit vector e\ leads to the same probability distribution on dfi. 

The most natural coordinates for dfi are the eigenvalues e 101 , . . . , e ™ and the 
mass that dfi gives to them: fi\ = fi({e l61 }), . . . ,fi n -i — ^{{e 16 ™- 1 }). As / dfi = 1, 
we omit fi n = fi({e ie ™}). Note that we have chosen not to order the eigenvalues, 
which means that the natural parameter space gives an n!-fold cover of the set of 
measures. We have already used this way of thinking a number of times, beginning 
with H 1 . 1 p . 

The above system of coordinates does not cover the possibility that U has de- 
generate eigenvalues. However, as the Weyl integration formula shows, the set of 
such U has zero Haar measure; in fact, the density vanishes quadratically at these 
points. The reason for this is worth repeating (cf. |17l §VII.4]): The submanifold 
where two eigenvalues coincide has co-dimension three in U(n); one degree of free- 
dom is lost from the reduction of the number of eigenvalues and two more are lost 
in the reduction from two orthogonal one-dimensional eigenspaces to a single two- 
dimensional eigenspace. One should compare this to spherical polar coordinates 
in R 3 where r = is a submanifold of co-dimension three and consequently, the 
density also vanishes to second order. 

In Section |31 we will determine the probability distribution on dfi induced from 
Haar measure on U(n), in the 9,fi coordinates. Conjugation invariance of Haar 
measure implies that the eigenvalues and masses are statistically independent; it 
is then easy to see that the former are distributed as in (|1 . 1|> and (/ii, . . . , fi n ) is 
uniformly distributed on the simplex ^2 A*j = 1- See Proposition 13 . 11 This implies 
that dfi gives non-zero weight to each of the eigenvalues with probability one. As 
a consequence, we can always recover the eigenvalues from dfi. 

We will now introduce different coordinates, (ao, ■ • ■ , a n _i), for dfi that arise in 
the study of orthogonal polynomials on the unit circle. 

The monomials l,z, . . . , z n ~ x form a basis for L 2 (dfi) and so, applying the Gram- 
Schmidt procedure, we can construct an orthogonal basis of monic polynomials: <!>,,, 
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< j < n, with $j monic of degree j. We also define (f>j = &j/\\&j\\, which gives 
an orthonormal basis. 

There is a well-developed theory of such orthogonal polynomials, parts of which 
will be important in what follows. For a proper discussion of this theory, see 
HU, or [H3 Ch. XI]. 

The first important fact about the orthogonal polynomials is that they obey 
recurrence relations: 

(2.1) 3> k+1 (z)=z$ k {z)-a k $* k (z) 

(2.2) = **(*) 

where the a k arc the recurrence coefficients and denotes the reversed polynomial: 



k 

£« 

i=o z=o 



(2.3) $ fe (^)-^ c/ z' =► $* k {z)= ) n -r. 



Equivalently, $* k {z) — z <& k {z 1 ). These recurrence equations imply 

k-i 

(2-4) IWL^) = 11^ where P/ = V / l _r KF I 

1=0 

from which the recurrence relations for the orthonormal polynomials are easily 
derived. 

The recurrence coefficients a k have been called by many names; we will fol- 
low |15| . where they were recently dubbed 'Verblunsky parameters'. Each of 
ao, . . . , a n -2 lies inside the unit disk, B, while lies on its boundary, S 1 . 

There is an alternate way of relating measures to their Verblunsky parameters, 
namely the Schur algorithm: If dfi is a probability measure, then we define its Schur 
function /:D -> D by 

(2.5) /(z) = I£|f|_l where F(z) = j '^±£^( e «). 

The Schur algorithm parameterizes analytic maps / : H> — > D by finitely or infinitely 
many parameters a k . 

There are finitely many parameters if and only if / is a finite Blaschke product, 
or, equivalently, iff dfi has finite support. More precisely, the support of dfi consists 
of n points iff / is a Blaschke product of degree n. This is the case when there 
are n Verblunsky parameters: cto, . . . , a„_2 € © and a„_i <E S . When there are 
finitely many parameters, the last must always be unimodular. In fact, the final 
parameter is essentially equal to the product of the locations of the mass points of 
dfi; see (|B.4|I . 

When dfi has infinite support, there are infinitely many Verblunsky parameters, 
all of which lie in the unit disk. 

Just as the Schur algorithm gives a bijection, so there is a bijection between mea- 
sures dfi on S 1 supported at n points and sequences of parameters ao, . . . , a n -2 £ D, 
Q! n _i € S . This justifies their use as coordinates for the measure dfi. 

In Proposition ^. 31 we determine the probability distribution on dfi (induced by 
Haar measure on U(n)) in these new coordinates. Interestingly, the as turn out to 
be statistically independent, with a k ~ &2n-2k-i- 

It is now but a few short steps to the (3 = 2 case of Theorem ^ 
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Consider the operator f(z) i— ► z/(z) in L 2 (dfi). The spectral measure associated 
to the vector f(z) = 1 is simply d/i. To obtain a matrix model, we need only choose 
a basis in which to represent this operator. The most obvious choice is the basis 
of orthonormal polynomials, {4>k}- This leads to a matrix whose entries can be 
expressed simply in terms of the as. However, this matrix is not sparse: all entries 
above and including the sub-diagonal are non-zero (with probability one). Such 
matrices are typically known as being in Hessenberg form. In deference to this, we 
will denote the matrix by H. It plays an important role in the determination of 
the distribution of the Verblunsky parameters, but does not appear in Theorem ^ 

The matrix LM described in Theorem ^ is f( z ) l— * z fi z ) m L 2 (d[i) in the 
orthonormal basis formed from l,z, z by applying the Gram-Schmidt pro- 
cedure. That this matrix can be expressed so simply in terms of the Verblunsky 
coefficients is a discovery of Cantero, Moral, and Velazquez, 0. Related matters 
are discussed in Appendix [5] (The matrix ML is the same operator in the basis 
formed by applying Gram-Schmidt to 1, z~ , z, ) 

Thus far, we have only spoken about the unitary group, that is, about = 2. 
In this case, we have found a random ensemble of measures whose mass points are 
distributed as the particles in the Coulomb gas at inverse temperature (3 — 2. The 
key discovery, however, was that the corresponding Verblunsky parameters turn 
out to be independent. 

For more general 0, we wish to find an ensemble of measures so that the mass 
points are distributed appropriately; we have complete freedom in choosing how 
the weights are distributed. By the same token, we want the induced probability 
distribution on the Verblunsky parameters to retain independence. We can then 
form the matrix set out in Theorem ^a-nd its eigenvalues are guaranteed to follow 
the proper distribution. 

The key to satisfying these desires is Lemma 14.11 It expresses the value of 
the Toeplitz determinant associated to d\x in terms of the 6, fi coordinates and 
in terms of the Verblunsky parameters. Multiplying the probability distribution 
from the (5 = 2 case by the appropriate power of the Toeplitz determinant gives 
Proposition ^. 21 which is exactly the resolution of the goals set forth in the previous 
paragraph. 

As an off-shoot of proving Theorem ^ we are able to determine the Jacobian 
for the map from the (9, /^-coordinates to the Verblunsky parameters Oft. That 
this is possible is a delightful idea of Dumitriu and Edelman 5 . (See Lemmas 14.31 
andIO) 

Were we granted the Jacobian for this map, the paper could be much shorter — 
though we contend that the scenic route followed below is not without merit. It 
is a natural quantity to calculate and the answer takes a rather simple form. This 
behooves us to find a simple, direct derivation. Thus far, we have failed. We would 
be much obliged to any reader who can resolve this matter. 

The proof of Thcorcm[2]is very similar. Again we begin by studying the problem 
for (3 = 2. The relevant group in this instance is not U(n), but rather SO(2n). 
Such matrices have eigenvalues in complex conjugate pairs and the corresponding 
eigenvectors are complex conjugates of one another. Consequently, the spectral 
measure associated to e\ is symmetric with respect to complex conjugation. The 
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most natural coordinates are 9j G (0, ir) and /ij € [0, 1] where 

~ n 

(2.6) / f df ,=j2^[f(e ie n+f(e- ie n] 

i=i 

and = 1 • 

Once again, we use the Verblunsky coefficients as a second set of coordinates. 
These are now real as a consequence of the complex conjugation symmetry of the 
measure. Indeed, a measure has this symmetry if and only if its Verblunsky coef- 
ficients are real. From this and the foregoing discussion of the general case, we see 
that the last Verblunsky coefficient, a2n-ii must be real and unimodular. In fact 
it must be —1 because the product of the eigenvalues of a matrix from SO(2n) is 
equal to one; see (|B.4|) . The remaining Verblunsky coefficients a k , < k < 2n — 2, 
are free to range over (—1, 1). 

By proceeding very much as before, we can construct certain ensembles of orthog- 
onal matrices for which the spectral measure is distributed in a desirable fashion. 
When the Verblunsky coefficients are real, both det(l — U) and det(l + U) have 
simple expressions in terms of these coefficients. As a result, we are able to add 
two new parameters a and b to our family of distributions. This line of reasoning 
leads us to Proposition 15. 31 

Given a measure, d/i, on S 1 that is symmetric with respect to complex conjuga- 
tion, one may define a measure on [—2, 2] by 

(2.7) / f(z + z- 1 )dfi{z)= [ f{x)dv{x). 

JS 1 J-2 

In particular, if d\x is of the form (|2.(il) . then we find 

In this way, we find that Proposition l5.3l relates an ensemble of probability measures 
on [—2, 2] to a certain ensemble of Verblunsky coefficients. In fact, the locations of 
the masses of dv are distributed as the points in the Jacobi ensemble, l|1.9f) . and 
are independent of the masses. 

Theorem follows immediately from the fact that the matrix J represents 
f(x) i— ► xf(x) in L 2 (dv) with respect to the basis of orthonormal polynomials. 
The remainder of this section is devoted to explaining the origin of this fact. 

Let P k (x) denote the monic polynomials orthogonal with respect to dv andpfc(ir), 
the corresponding orthonormal polynomials. These obey a three-term recurrence 
relation §3-2]: 

(2.8) xp k (x) = a k+ ip k+l {x) + b k+1 p k (x) + a k p k -i(x), 

which explains the structure of the matrix J. 

It is a famous observation of Szego (see ^| §11.5]) that the polynomials orthog- 
onal with respect to d/i are intimately related to those orthogonal with respect to 
dv. Specifically, 
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or equivalently, 

, , - X \ z ~ k hk(z) + z k (t) 2k (z- 1 ) 

r>h (z + z = ■ . 

Attendant to this relation between the orthogonal polynomials is a relation between 
their recurrence coefficients: 

h+i = (1 - ot 2 k-i)ot2k - (1 + a2fc-i)a2fc-2 

ak+i = {(1 - Q!2fe-i)(l - a 2 2k ){l + a 2 fc+i)} 1/2 - 

These equations are known as the Geronimus relations. They are exactly equations 
(I1.12fl and i|1.13fl from the statement of Theorem [3 This shows that the matrix J 
defined in that theorem really does represent multiplication by x in L 2 {dv). 

The connection between the orthogonal polynomials for d\x and dv suggests a 
relation between the matrices LM of Theorem ^ an d J of Theorem |3 This is 
investigated in Appendix [El in particular we are able to give a short proof of the 
Geronimus relations. 



3. Distribution of dfj. for U(n) and SO(2n) 

Let ex denote the standard unit vector (1,0,..., 0) T . As described in the Intro- 
duction, for each U € U(n) we consider the spectral measure associated to the pair 
(U, ei), that is, the unique measure on S 1 — {z £ C : \z\ = 1} that obeys 




for all n e Z. The first goal of this section is to determine the probability distri- 
bution of d[i when U € U(n) is chosen according to Haar measure. We will then 
prove the analogous result for SO(2n). In each case, we give the distribution both 
in terms of the natural parameters (the eigenvalues and associated masses) and in 
terms of the Verblunsky parameters. 

Proposition 3.1. If U is chosen according to Haar measure on U(n), then the 
probability measure on dfj, is given by 

(3.1) (T [/o Vn l A ( ei91 ' • • • ' e^")| 2 ^i • • • de n dfil... dn n -x 

n\{2ir) n 1 1 

where < 9j < 2n and < fij < 1 with ^ fij < 1 . 

Proof. Conjugation invariancc of Haar measure on U(n) implies that the distribu- 
tion of the eigenvectors is independent of that of the eigenvalues. The distribution 
of the eigenvalues is given by the Weyl integration formula (|l.l(l . while the masses 
are simply the square moduli of the top entries of the normalized eigenvectors. 

Of course, the normalized eigenvectors are only determined up to a phase factor. 
If we choose this phase factor at random from the unit circle, then by conjugation 
invariance, the top entries of the eigenvectors are distributed as a random unit 
vector from C n . Therefore, by Corollary IA. 31 (pi, . . . ,fi n ) is uniformly distributed 
on the (n — l)-simplex ^2 f-j = 1- ^ 

The key to writing the probability measure on dfi in the Verblunsky coordinates 
is the Householder algorithm |12l §6.4], which converts any matrix to one in Hes- 
senberg form (i.e., with zeros below the sub-diagonal) via unitary conjugation. The 
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algorithm proceeds iteratively, killing the undesirable elements in each column in 
turn, working left to right. 

Consider a matrix A whose first k — 1 columns are zero below the sub-diagonal. 
Let us write A k — [fti.fc, ■ ■ • , o-n,k\ T for the fcth column of A and then define 

(3.2) v — [0, . . . , 0, a, a k +2,k, ■ ■ ■ , a n ,k] T 

(3.3) a = a k+ i. k - ak+1 - k J\a k+lk \ 2 H h |a„ !fc | 2 . 

|afc+i,fe| v 

The reflection through the plane perpendicular to v is given by R = I -25r. ft 
maps A k to A k — v, which has zeros in the desired places. Moreover, the first k — 1 
columns of A are unchanged by left multiplication by R because they are orthogonal 
to v — indeed, their only non-zero entries coincide with zeros in v. Similarly, right 
multiplication of any matrix by R leaves its first k columns unchanged. 

As R is a reflection, R^ 1 = R = R' and so we find that conjugating A by R 
gives a new matrix whose first k columns agree with Hessenberg form (i.e., with 
zeros below the sub-diagonal). In this way, we have described one step of the usual 
Householder algorithm. However, we wish to add one further conjugation so as to 
make the entries on the sub-diagonal non-negative. To do this, we form DRARD^ 
where D differs from the identity matrix by having {k + 1, k + 1) entry e _l<6 with 
4> chosen appropriately. 

Starting with a unitary matrix U, we can apply the above algorithm to obtain a 
unitarily equivalent matrix, H, in Hessenberg form with non-negative sub-diagonal. 
Moreover, the spectral measure for (H, e\) is the same as that for (U, e\) — namely 
d/i — because the vector e\ is fixed by all the unitary matrices by which U is conju- 
gated. 

In the Introduction, we used H to denote the matrix representation of f(z) i— > 
zf(z) on L 2 (d^L) in the basis of orthonormal polynomials. It is easily seen to be 
in Hessenberg form and to have a non-negative (indeed positive) sub-diagonal. As 
one might hope from the notation, these two matrices are the same (see also ^ 
Corollary 3.3]): 

Lemma 3.2. Suppose H is an n x n unitary matrix in Hessenberg form with non- 
negative sub- diagonal and let dfi denote the spectral measure associated to the vector 
e\. If the support of dfi consists of n points, then H represents f{z) i— > zf(z) in 
the basis of orthonormal polynomials. Consequently, 

{-Oi-lOlj Ui=i Pi i < J + 1 
Pj-i i = j + 1 

i > j + 1 

where pj = yl ~ \ a j\ 2 an d a -i = ~ !• 

Proof. As dfx is the spectral measure for (H, e±) and as L 2 (d(x) has the same di- 
mension as the space on which the operator H acts, there must be an orthonormal 
basis /i, . . . , /„ for L 2 (d[i) with f = 1 such that H represents f(z) i— > zf(z) in 
this basis. This is just the spectral theorem combined with the fact that e\ must 
be cyclic (for otherwise, L 2 (d/i) wouldn't have full dimension). 

From the cyclicity argument we also learn that H must have a strictly positive 
sub-diagonal. 



MATRIX MODELS FOR CIRCULAR ENSEMBLES 



11 



To finish the proof of the first claim, we need only show that fj(z) = 4>j-i(z); 
that is, that the orthonormal basis in question is precisely that of the orthonor- 
mal polynomials. Because H is in Hessenberg form with positive sub-diagonal, 
the standard basis vectors arise from applying the Gram-Schmidt procedure to 
e±,Hei, . . . ,H n ~ 1 e\. Consequently, the vectors fj must be the result of applying 
the same procedure to l,z, . . . , z n \ that is, fj must be 4>j-i- 

The first part of l|3.4|l merely reexpresses what we have just proved. The second 
follows from the recursion relations and (|2.4|) : however, the proof is not particularly 
enlightening and we refer the reader to [S] for details. □ 



We will now apply the Householder algorithm outlined earlier to a matrix cho- 
sen at random from U(n). By the lemma above, this will allow us to determine 
the induced distribution on the Verblunsky parameters associated to the spectral 
measure for (U, ei). 

Proposition 3.3. Let d/i be the spectral measure for (U, e\) with U chosen at ran- 
dom from U(n) according to Haar measure. In terms of the Verblunsky parameters, 
ao, . . . ,a n -2, and ct n -\ = e 1 ^ , this probability distribution is given by 

n-2 

(3.5) |^ [] (! K| 2 )"~ 2 ~ fc d 2 a ■ ■ ■ d 2 a n _ 2 #. 

fc=0 

That is, the Verblunsky parameters are independent and ctj ~ 02 n _2j-i- 

Proof. The key to applying the Householder algorithm to a random element U G 
U(n) is the following realization of Haar measure: Choose the first column at 
random from the unit sphere; then choose the second column from the unit sphere 
of vectors orthogonal to the first; then the third column and so forth. In this way, 
one could say that the columns of U form a random orthonormal basis for C™. 
(That this is indeed Haar measure is a simple consequence of invariance under left 
multiplication by unitary matrices.) 

The first column of U is a random vector from the unit sphere. After left 
multiplication by the appropriate reflection R, the new first column takes the 
form [ao, b, 0, ... , 0] T where ao is the the original (1, 1) entry of U and so C^n-i- 
distributed, while b has modulus pa and arbitrary argument. Subsequent left mul- 
tiplication by D converts the first column to [ao,po,0, . . . ,0] T , as it will remain. 
The other columns are still orthogonal to the first; indeed, they form a random 
orthonormal basis for the orthogonal complement of the first column. 

Right multiplication by RD^ leaves the first column untouched while orthogo- 
nally intermixing the other columns. Of course, this means that they remain a 
random orthonormal basis for the orthogonal complement of the first column. (Re- 
member, Haar measure is also invariant under right multiplication by a unitary.) 

For the subsequent columns, the procedure is similar. Let us skip ahead to 
dealing with the A;th column. 
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From the unitarity of the matrix H from the previous lemma, 



P0P1P2 ■ ■ ■ Pk-2 
-a p 1 p 2 ■ ■ ■ Pk-2 
-a 1 p 2 ■ ■ ■ Pk-2 



X = 



— OLk-ZPk-2 
— OLh-2 











is a unit vector orthogonal to the first k — 1 columns. As the kth column is a 
random vector orthogonal to the first k — 1 columns, its inner product with X is 
distributed as the top entry of a random vector from the (2n — 2k + l)-sphere and 
is independent of a%, . . . ,a k ~2- Let us call this inner product dtk—i, noting that 
this implies a k -i is 02n-2fc+i-distributed as stated in the proposition. 

We now multiply the matrix at hand from the left by the appropriate reflection 
and rotation to bring the kth column into the desired form. Neither of these 
operations alters the top k rows and so the inner product of the kth column with 
X is unchanged. But now the kth column is uniquely determined; it must be 
a k -iX + p k -ie k +i, just as in 13. 4|) . 

Lastly, we should multiply on the right by RD\ but this leaves the first k 
columns unchanged while orthogonally intermixing the other columns. In this way, 
we obtain a matrix whose first k columns conform to the structure of H, while the 
remaining columns form a random basis for the orthogonal complement of the span 
of those k columns. 

In this way, we can proceed inductively until we reach the last column. It is 
obliged to be a random orthonormal basis for the one-dimensional space orthogonal 
to the preceding n—1 columns and hence a random unimodular multiple, say 
of X. This is why the last Verblunsky parameter is Oi-distributed. 

We have now conjugated U to a matrix in the form of i|3.4fl with parameters 
distributed as stated in the proposition. The vector e\ is unchanged under the 
action of each of the conjugating matrices and consequently, these are precisely the 
Verblunsky parameters of dp,. □ 

We now turn to the study of Haar measure on SO(2n). The proofs follow those 
given above pretty closely. 

Proposition 3.4. If U is chosen at random from SO(2n) according to Haar mea- 
sure, then the spectral measure dp associated to (U,ei) is distributed as 



Proof. By the Weyl integration formula for SO(2n), the marginal distribution of 
the eigenvalues is as above. (See |EJ §VII.9].) 

If the eigenvalues are prescribed, say e ±lSl , . . . , e ±l9 ", then the conditional dis- 
tribution of U is given by taking a fixed matrix with this spectrum and conjugating 



(3.6) 




where 9j and fij are the coordinates given in l|2.6|l . 
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it by a random clement from SO(2n). The natural choice for this fixed matrix is 
block diagonal: 



Uq = diag 



cos(0i) sin(#i) 
— sin(#i) cos(#i) 



cos(6>„) sin(0 n ) 
-sin(0„) cos(6>„) 



From this we see that the d\x is the spectral measure for Uq and a random vector 
from the (2n — l)-sphere. The proposition then follows by diagonalizing Uq and 
applying Corollary IA. 31 □ 

Proposition 3.5. Let U be chosen from SO(2n) according to Haar measure and let 
d/i denote the spectral measure for (U,e\). In terms of the Verblunsky parameters, 
the probability distribution on dfi is 

2n-2 

(3-7) ^ H(l- at)^ 1 dao--- da 2n -2 

k=0 

with «2n-i = — 1- That is, the Verblunsky parameters are independent and ctk ~ 

r>/2n— k— 1 2n — k— 1\ 
B\ g ' 2 >■ 

Proof. While we may use the Householder algorithm as set out above, the fact that 
we are now dealing with real-valued matrices allows the following simplification: 
the vector defining the reflection is as in l|3.2[l . but now with 



a - a k +i,k - \] a l + i, k H 1" a 2n,k 

instead of l|3.3[l . This permits us to forgo the conjugation by D. 

Haar measure on SO(2n) can be realized by choosing the first column as a 
random vector from the unit sphere in M 2n , and then the second as a random 
vector orthogonal to the first, and so forth. However, the fact that the matrix has 
determinant one means that the first 2n — 1 columns completely determine the last. 
One may say that the columns of U form a random positively oriented basis for 
R 2n . 

Proceeding as in the proof of Proposition ^. 31 we see that at-i is defined as the 
inner product of a specific vector X with a random unit vector from the (2n — k — 2)- 
sphere of vectors orthogonal to the first k— 1 columns. It follows from Corollarv lA.2l 
that — B( 2n ~2~ 2 , 2 "~ 2 fc ~ 2 ) as stated above. 

The last column of H, and hence a2 n -i, is uniquely determined by the fact that 
dct(iJ) = 1. It is just a matter of using (|3.4|) to determine which value of «2n-i 
makes this determinant one; moreover, by continuity of the determinant, it suffices 
to consider the case where all other as are zero. This gives 

1 = det(-ff) = -a_ia 2n -i sign(cr) = —a 2n -i 

where a is the cyclic permutation j : t— > j : + I mod 2n, which is odd. 

Lastly, we should justify the normalization coefficient given in l|3.7|l : what ap- 
pears there is very much simpler than one would expect from HI.IQfl . This sim- 
plification is based on the duplication formula for the T function: y/iTT(2t) = 
2 2 *- 1 r(<)r(i + |). Specifically, beginning with lfTT0|) . 



i r(<) 3 yfr(i]' 



14 



R. KILLIP AND I. NENCIU 



which causes the product of normalization coefficients to telescope: 

_„r(n) _ (n-1)! 

as given in 13.71) . 



n 

fc=0 



7T 2 



r(|) 



□ 



4. The Proof of Theorem 1 
Let dji be the measure on S 1 given by 



(4.1) 



P n 

I J<'i> »</"'* ) 

i=i 



with 9j € [0, 27r) distinct and 53 Mj = 1- As discussed in the Introduction, this 
measure is uniquely determined by its Verblunsky parameters ao, . . . ,a n -2 € D 
and q„_i = € S 1 . 

It is difficult to find functions of dfi that admit simple expressions in terms of 
both 0j, /Ltj and the Verblunsky parameters. One such quantity is the determinant 
of the associated Toeplitz matrix; this is the subject of the next lemma. 

Lemma 4.1. Ifdfi is a probability measure of the form given in (|4.1|) and ao, ■ ■ ■ , oin-1 
its Verblunsky coefficients, then 

n n— 2 

(4.2) | A(z 1; . . . , z„)| 2 J] N = J] (1 - lafcl 2 )"-*" 1 . 

3=1 fc=0 

Proof. Let = J^" =1 Mj^j denote the moments of d/i. We will prove that both 
sides of <|4.2[) are equal to the determinant of the n x n Toeplitz matrix associated 
to d/i: 

c c_i • • • ci_ 
ci c • • • C 2 - 



T = 

If we define A and Af by 
1 1 



Cn-l C„_2 



T 



21 



^2 



n-1 „n-l 

^1 



A/ = 



CO 



Mi 

M2 





then T = AM A" 1 . Consequently, 



det(T) = |det(A)| 2 det(M) = \A(z u z n )\ 2 Hj- 

3=1 

We will now show that the right-hand side of 14.211 is equal to det(T). To this 
end, let 

$0(21) $0(22) ■•• $o{z n ) 

$l(2l) $1(^2) ••• $l(Zn) 



B = 



$„_i(zi) $„_i(z 2 ) 
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which has the same determinant as A because each can be reduced to the other 
by elementary row operations. From the orthogonality property of the {^j}, it 
follows that BMB^ is the diagonal matrix whose entries are the squares of the 
L 2 (cfyi)-norms of $ , $],,... ,$„-i. Therefore by (|2.4I) . 

n— 1 n — 2 

dct(T) = det(BMBt) = J| \\<S> k \\ 2 L2W = Y[(1- W\ 2 ) n - k -\ 

k=0 k=0 

just as was required. □ 

Both expressions for the Toeplitz determinant are well known; indeed, the argu- 
ment presented above and its Hankel-matrix analog play a central role in random 
matrix theory. 

We are now ready to state and prove the main result of this section. Please note 
that neither measure given below is normalized; however, they do have the same 
normalization coefficient. It is calculated in Lemma T4.4I where it is used to give an 
independent proof of (|1.6(l . 

Proposition 4.2. The following formulae express the same measure on the mani- 
fold of probability distributions on S 1 supported at n points: 

(4.3) __|A(e*V..,e ie ")| /3 r[/' t / dQl ' ' " d9n dfil ' ' " d/X «" 1 

3=1 

in the {9, /x)- coordinates of l|4.1|l . and 

n-2 

(4.4) H (1 Kl 2 )^"-*- 1 )- 1 d 2 a ■ ■ ■ d 2 a n -2 dcj> 

k=0 

in terms of the Verblunsky parameters. 

Proof. When f3 = 2, this follows immediately from Propositions 13.11 and 13.31 To 
obtain the general-/? version of H4.3J) from the (3 = 2 version, one has to multiply 

by 



(4.5) |A(e*V..,e^r 2 n^ -1 > 

3=1 

while the same transformation of (3 in (|4.4I) is effected by multiplying by 

(4.6) n^-Ki 2 )^- 1 ^-^. 

fc=0 

But, H4.5f) and l|4.6|l are equal; they are either side of (|4.2|) raised to the power 
f-1. □ 

Proof of Theorem 1. Theorem ^ is an immediate corollary of Proposition 14.21 and 
results from [3]: The Verblunsky parameters of the spectral measure for (LM,e\) 
are precisely the as that appear in the definition of L and M . Consequently, if the 
Verblunsky parameters are distributed according to (|4.4|) . then the eigenvalues are 
distributed as in l|1.5l) . □ 
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It is fair to suggest that studying the unitary group is a rather roundabout way 
of proving the above proposition. We simply do not know a better way. The natural 
suggestion is to first calculate the Jacobian for the map from the (0, /^-coordinates 
to the Verblunsky parameters and to proceed from there. While we can determine 
this Jacobian a posteriori by employing a cunning idea of Dumitriu and Edelman, 
we do not have a direct derivation. 

The idea of Dumitriu and Edelman can be summarized as follows: 

Lemma 4.3. Suppose <f> : 0\ — > O2 is an N-fold cover of O2 by Q\, both of which 
are open subsets o/R n . If the measure f(x)d n x is the symmetric pull-back of the 
measure g(y)d n y, with both f and g positive, then the Jacobian of ip is given by 

Nf(x) 
go<f>(x) 

for any x € 0\ . 

Proof. For every igOi, 

f(x)d n x = r(j?9(y)d n y) = j r [gocj ) }(x)\cf ) '(x)\d n x, 

which proves the lemma. □ 

Proposition ^. 21 allows us to apply this lemma to the current situation. The map 
of 8, fi to the Verblunsky parameters is an n!-fold cover and so we obtain 



(4.7) 
(4.8) 



d(a,4>) 



9(0, fi) 



= 2 i-n |A(e 



n^o(i-Ki 2 )"- fc - 2 

,l-nIl£o(l-hk| 2 ) 



nn 
3=0 »j 

where one should regard the Verblunsky parameters as functions of the Os and 
/is. The formulae above correspond to applying Proposition 14.21 with (3 — 2 and 
P = 0, respectively. Of course, one can also use other values of f3, but the resulting 
formulae are related to one another by Lemma 14. II 

Earlier we promised to determine the (common) normalization coefficient for the 
measures (14. 3f) and 1)4. 4|l . We also promised to give a new derivation of the partition 
function, for the Coulomb gas. We will now settle these obligations. 

Lemma 4.4. The integral of 14.4f> is 

n — 2 

(4.9) J.f na-Ki 2 ) 1 '"-*-"- 1 ^-- 



k=0 



pn-l( n _ 1 y 



while 



(4.10) /••7|A(e*V..,e-")f^...^ ^ + 1) 



2ir 2ir [r(I/3+l)]"' 
in agreement with (|1.5|) and ()1.6(l . 

Proof. Each of the integrals in (|4.9(l is rendered trivial by switching to polar coor- 
dinates: 

\l-\z\ 2 )W- 1 d 2 z = 2nt- 1 . 



It is this integral that gives the normalization coefficient in the definition of the 
distributions; cf. (|1.7|l . 
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The proof of (|4.10|l begins with the evaluation of the Dirichlet integral 

r tt ^ a T ^ )n 

/ 11 d ^--- d ^-x = -7^T, 

JA j=l 1 \ 2 ) 

which is derived in the proof of Lemma lA.41 see (|A.4(1 . As the two measures in 
Proposition 14.21 have the same total mass, the integral of (|4.3|) is given by l|4.9|l . 
Combining this with the Dirichlet integral above leads us to 

' "* 2 1 -«/3™- 1 (n- i)!r(|)« r(f + i)« 

which is exactly (|1.6(l . □ 



5. The Proof of Theorem 2 

As explained in Section[21 Theorem|21is an immediate corollary of Proposition l5.3l 

and the Geronimus relations. As a result, the primary purpose of this section is to 
prove this proposition. 

Throughout this section, <i/x will denote a probability measure of the form given 
in H2.6J) . In particular, it is symmetric with respect to complex conjugation and the 
last Verblunsky parameter, a2n-i, is equal to — 1. We will also use the notation 
Xj = 2 cos(9j ) repeatedly. 

In addition to Lemma 14.11 from the previous section, two further lemmas are 
required. They are the following: 

Lemma 5.1. If Xj = 2cos(#j), 1 < j < n, then 

n 

(5.1) |A(e ±i( \ e ±ie \ e ±w ")\ = \A(x u x 2 , x n )\ 2 J] |2sin(0,)| 

i=i 

where the left-hand side is shorthand for the Vandermonde of the In quantities 

giOi g—iOi e iO n e -i8 n 

Proof. By expanding the Vandermonde as in i|1.2fl . 

|A(e ±lf \e ±lf \.--,e ±ie ")| 



n|e l9i -e^| J] 

1 = 1 j<k 



e i0, _ e ie k \2, e ie j _ e -ie k \2 



= H 1 2 sin(0O| H [2 cos(^) - 2 cos(# fe )] 2 , 
i=i j<k 

as required. In the last step we used that 

\{z — w)(z — w)\ — \(z — w){\ — zw)\ = \(z + z) — (w + w)\ 
for any pair of points z, w on the unit circle. □ 
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Lemma 5.2. Let d/i be a measure on the unit circle of the form given in H'2.6(l and 

let $k denote the corresponding monic orthogonal polynomials. Then 

2n— 1 2n-2 

(5.2) ]J(2 - Xj ) = $ 2n (l) = J] (1 - a fc ) = 2 JJ (1 - ttfc ) 

j fc=0 fe=0 

2n-l 2n-2 

(5.3) n( 2 + = = n c 1 + c- 1 )"^) = 2 n c 1 + 

j fc=0 fc=0 

where Xj — 2cos(9j) and ctk denote the Verblunsky parameters of d/i. 

Proof. As $2n is orthogonal to each of 1, z, . . . , z 2n_1 , which forms a basis for 
L 2 (dfi), its zeros must be e ±lSl , . . . , e ™. Consequently, for any 

*2n(a:) =IJ|x-e w *| a = J|[x 2 -2xcos(^) + l], 

which gives the first equality in each of l|5.2|) and l|5.3l) . 

Because all Verblunsky parameters are real, the coefficients of the orthogonal 
polynomials are also real. This implies Q%{z) = z fc $fe(z _1 ) and so, for z = ±1, the 
recurrence equation l|2.1|l becomes &k+i( z ) — ( z — ctk zk )®k( z )- Each of the second 
equalities stated in the lemma now follows by the obvious induction. The third 
equalities simply express a.i n -\ = — 1. □ 

The following proposition is the SO(2n) analogue of Proposition ^. 21 and so the 
main ingredient of the proof of Theorem [5] 

Proposition 5.3. Consider the following measure on [—2,2]™ x A : 

— \A(xi, x n )f Y[ Hj II t( 2 - ^H 2 + x j) b ] dxi ' ' ' dXn d Vi ■ ■ ■ d Mn-l 
J'=l J=l 

where n — (n— l)£+a+&+l. Under (|2.6[1 and £/ie change of variables Xj — 2cos6j, 
this gives a measure on d/i (which is not normalized) . Transferring this measure to 
the Verbunsky parameters gives 

2n-2 2n-2 

f[ (1 - a^^- 1 Hi 1 ^ k ) a+1 ^(l + (-l) k a k ) b+1 ^ da Q ■ ■ ■ da 2n ^ 2 

k=0 k=0 

and a.i n -\ = 1. After normalization, this measure corresponds to choosing the 
Verbunsky parameters independently with distribution given by 



B ( 2n-k-2 p + a + 1) 2n-fc-2 / g + ft + 1) et , eTlj 
B ( 2n-k-3 p + a + ft + 2, 2 "- fc ~ 1 /3) jfe Odd. 



(5.4) a fc 

t^l — 4 

T/ie definition of B(s,t) is given in <|1.1(J[I . 

Proof. From Lemma 15 . II and Lemma 14.11 we may deduce that 

n 

\A( Xl ,x 3t . . . , x n )\ 2 1] |2sin(0,)| = \A(e ±i9 \ e ±ie *, . . . , e ±l *")| 

(5-5) 

=2» n(i-«2) (aB " fc ~ i)/2 ii/*7 i - 
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By forming the square-root of the product of (|5.2J) and (|5.3|1 , we can rewrite the 
product of |2sin0/| in terms of the Verblunsky parameters: 

n 2n-2 

(5.6) JJ|2sin(0OI =2 JJ (l - a k ) k (l + (-l) k a k )K 

1=1 k=0 

Substituting this into (|5.5|l above and doing a little rearranging of terms leads us 
to 

r 2n-2/ 



n -p-rzn — z 2\ 2 

(5.7) 2^\A( Xl ,x 2 , . . . , x„)| 2 J] H = 2 t 1 ~ r- 

We will return to this equation in a moment. 

For (3 = 2 and a = b = — h , the proposition is an immediate corollary of Propo- 
sitions 13.41 and 13.51 together with (|5.6[1 . The latter arises as the Jacobian of the 
change of variables from 9j to Xj = 2 cos(6j). 

Changing a and b amounts to multiplying this result by the appropriate powers 
of 15.2(1 and l|5.3Jl . respectively. To see that the two measures are equivalent for 
(3 5^ 2, it suffices to multiply by l|5.7|) raised to the & — 1 power. □ 

Combining this proposition with Lemma l4.3l permits us to determine the Jaco- 
bian of the map from the (9, /^-coordinates to the Verblunsky parameters. From 
(3 = 2, a = b = — h we obtain 



(5.8) 



d(a) 



d{Q,n) 



2 1 ~"|AQr 1 ,...,0| ; 

-1-1-211-2/, o\ 2 "- fc - : 

IL=o (!-«*) 2 



We do not have a direct derivation of this fact. 

6. The Selberg and Aomoto Integrals 

In ^3]> Selberg evaluated the following integral: 
pi pi n 
(6.1) j^-j^ \A( Ul ,...,u n )\ 2z J[u°;- 1 (l-u j )v- 1 du 1 ---du n , 

which subsequently turned out to be important in random matrix theory. We 
will present a new derivation of his result in a manner analogous to the proof of 
Lemma 14.41 above. 

We begin with the evaluation of the (common) integral of the measures given in 
Proposition 15. 31 In the case of the second measure, this gives rise to a product of 
beta integrals (cf. (|1.1U|) ') from which we obtain the answer 

jf nr=tll + a + mr=tll + b + H 2 (2»- fe -2)f +tt+ 6 + l 

(6 2) k ° vcn 

y 2 fT 3 r(2^/3 + a + b + 2)T(^1(3) (2 „_ fc _ 2)|+a+b+1 

M T(^^(3 + a + b + 2) 

k odd 

To aid in the eventual comparison with Selberg i we will switch to his parame- 
ters: 

x = a+l, y = b+l, and z = \(3. 
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We also wish to make the following simplifications: The products of the powers 
of 2 can be combined since they are the same for odd and even k, where we can 
easily sum the resulting arithmetic progression in the exponents. Similarly, we 
combine the products of the denominators into a single product and make the 
substitution p — 2n—k — 2. In the even-fc numerators, we will make the substitution 
r = ^(2n — k — 2) and in the odd-k numerators, the substitution s = \{2n — k — 3). 
Combining these we find that (|6.2|1 is equal to 

(6 3) r x KZo r(rz + x)T(rz + y) [Eg T(sz + x + y)T((s + l)z) 

Ilp=o 2 t (p z + x + y) 

where a = \{n — X)z + x + y — l](2n — 1). Note that there is a cancellation between 
the T(sz + x + y) terms in the numerator and the corresponding terms in the 
denominator. 

This essentially completes the determination of the total mass of the measures 
in Proposition 15.31 After the transformation Xj — Aiij — 2, the former of these is 
the tensor product of the measure in (|6.1|l with a measure in the [ij coordinates. 
This leads us to evaluate 

2- K f f-fj 2 , 2- K T(z) n _ 2- K z 1 ~ n r(z + l) n 



(6.4) 2 — J- --J J] n*- 1 dm-" dMn-i = 



n\ T(nz) (n - l)\T(nz + 1) 

where K = (n — l)z + x + y — 1. Moreover, taking the ratio (|6.3() / (|6.4() and making 
the cancellation mentioned above, we obtain 

/• • ■/ |A(x 1; . . . ,x n )\ 2z f[ [(2 - x j f- 1 {2 + xjf- 1 ] dxi ■■■dx n 
(6 5) J = 

= r+K x n"=o ^ + x)T(rz + y) + IJzTjjs + 1» 

r(« + i) a n^i%+» + ») 

Notice that the term z n_1 (n — 1)! from (|6.4|l was split up inside the product over 
s. To simplify, we use £T(£) = T(£ + 1) inside the product over s and notice that 
the final factor, T(nz + 1), just corresponds to s = n — 1. In this way, all products 
run over the same number of terms and can be combined. Therefore, we reach the 
final conclusion 

Vr 1 T(rz + x)T(rz + y)T((r + l)z) 

LHS O = 2 n ' 



r=o T ( z + 1 ) r (( n + r-l)z + x + y) 

where r = a + k = 2n{(n — l)z + x + y — 1]. This is in perfect agreement with 
Selberg's paper: the values of (|6.1|) and l|6.5|l differ by a factor of 2 T as a result of 
the change of variables Xj — iuj — 2; the measure is homogeneous of order r/2. 
We now turn to our second topic: the Aomoto integral. 

In Aomoto determined the average value of \\{x — Xj) when the points Xj 
are distributed according to the Jacobi ensemble, (|1.9fl . Theorem [5] shows that 
this is equivalent to evaluating the average of the characteristic polynomial for a 
certain ensemble of Jacobi matrices. In this way, Proposition 16 . II below reproduces 
Aomoto's result. 
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The answer is given in terms of the classical Jacobi polynomials: In the notation 
of HI, 

(a + b + n+l) n n { 2 X > 
are the monic polynomials that arc orthogonal with respect to the measure 

fd » = 4a+ 4; + +U + i) L m (2 - x)a (2 + x)b dx - 

Here (z) n = T(z + n)/T(z) is the Pochhammer symbol. The recurrence coefficients 
for the corresponding system of orthonormal polynomials are (cf. (12.8(1 ) 

_ 2(b 2 - a 2 ) 

" +1 ~ {2n + a + b)(2n + a + b + 2) 
'' ' o 16(n+l)(n + a + b+l)(n + a + l)(n + b+l) 



(2n + a + b + l)(2n + a + b + 2) 2 (2n + a + b + 3)' 

Proposition 6.1. Let J be the Jacobi matrix whose entries b±,a±, . . . ,b n are dis- 
tributed as in Theorem^ Then 

An | 

(6.7) E[det(.x - J)] = — I — — Pfr h \\x) 

[a + b + n + l) n 

where a = 2 ( a + is> — \ and b = 2 ^ 1 ^ — \ . 

Proof. We will show that both sides of (|6.7|l are related to the same monic orthog- 
onal polynomial on the unit circle. We begin with the left-hand side. 

Let ao, . . . , cx.2n-2 be distributed as in let <i/x denote the corresponding 

measure on S 1 , and let dv denote the measure on [—2, 2] induced by (|B.5|I . 

The characteristic polynomial of J is equal to P n (x), the nth monic orthogonal 
polynomial associated to the measure dv; indeed the principal k x k minor of x — J 
is equal to Pt- (By expanding along the last row, one can see that these minor 
determinants obey the same recurrence relation as the orthogonal polynomials.) 
Combining this with l|2.9|l gives 

(6.8) det((, + O - J) = ^^Y^- 1 ). 

As <£>£ is independent of a k (it depends only on ao, . . . , afe_i), the recurrence 
relations l|2.f[) and l|2.2|l yield 

E$ fc+ i(z) = zE$ fe (z) - Ea k E$* k (z) 
E$* +1 (z) = E$*(z) - zEa k E$ fe (z). 

Hence, E<&fc are the monic orthogonal polynomials associated to the averaged 
Verblunsky parameters: 



26-2a 



E(a fe ) 



(2n-fe-2)/3+2o+2b+4 ^ even i 

/3-2a-2fc-4 ■ dd 

(2n-fc-2)/3+2a+2b+4 * UUU ' 



(Note that if X ~ B(s,t) then E(X) = f=§ .) 

Applying the Geronimus relations to these averaged Verblunsky coefficients does 
not produce the recursion coefficients associated to the Jacobi polynomials. How- 
ever, by Proposition IB. 21 E$2n is also the monic orthogonal polynomial of degree 
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2n associated to the 'reversed' coefficients: <Sfc = Ea!2n-2-fe ; < k < 2n — 2, and 
cbn-i = — 1- Under the Geronimus relations, these coefficients give rise to 

- 2(P - d 2 ) 



(2k + a + b)(2k + a + b + 2) 
_ 2 _ lQ(k+l)(k + a + b+l)(k + a+l)(k + b+l) 
ak+1 ~ (2k + a + b + l)(2k + a + b + 2) 2 (2k + a + b + 3) 

where a = 2 ("+ 1 ) — 1 and b = 2 ^ 1 ^ — 1. By comparison with l|tj.6f) . we find 

iE{z-»$ 2n (z) + ^$ 2n (2- 1 )} - (a+ g:; 1)n 4 a ' B) (^)- 

Equation (|6.7[) now follows from (|6.8|l . □ 

Appendix A. The Surface Measure on S n 

This Appendix presents some elementary results used in the text and is provided 
solely for the reader's convenience. 

Let da denote the usual surface measure on S n and let da denote the corre- 
sponding normalized probability measure on S n . We will write ID n for the n-disk: 
D" = {x £ R" : \x\ < 1}. 

Lemma A.l. Iff : W l+1 -> C, then 

d n x 



[ f(x)da(x)=J2 [ f(x 1 ,...,x n ,±y/T=W) 



Proof. The map <j> : (x,, . . . , x n ) i— > (x,, . . . , x n , y/l — \x\ 2 ) is a diffeomorphism onto 
the open upper hemisphere. The corresponding Gram matrix is 

_ / d(/> d(f) \ _ x t \x\ 2 Xi xj 
" \<).r, 



i — <5j.i t — 
dxj I 1 



which is a rank one perturbation of the identity matrix. Therefore, 
det[G] = 1 



\-\x\ 2 \~\x\ 2 ' 

from which the lemma follows immediately. □ 

We wish to determine the distributions of certain probability measures induced 
from the normalized surface measure on S n . These follow easily from this lemma. 

Corollary A.2. For the projection of the n-sphere into its first coordinate, we have 

(A.l) fix,) da(x) = |* fit) (1 - * 2 )(- 2 )/ 2 dt, 

that is, X\ is B(^, §) -distributed. Projection onto the first two coordinates gives 

(A.2) / fix, + ix 2 ) da(x) = ^fi / f(z) (1 - |z| 2 )(«- 3 )/ 2 d 2 z, 

JS" 277 Jo 

which implies x, + ix 2 is Q n - distributed. 
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Proof. Both formulae follow from the more general statement that for 1 < k < n, 
f( Xl ,...,x k )da(x)<x ( f(x 1 ,...,x k )(l-\x\ 2 ) (n - k - 1 ^ 2 dx 1 ---dx k 



where the proportionality constant depends on k, but not the function /. The value 
of the normalization constant can then be determined by substituting / = 1. 
An inductive proof of the more general statement follows easily from 

2 J2 \(n-k-2)/2 n _fe_i 



f i 2 2 \(n-fc-2)/2 , 

J (s - x k+1 ) dx k+ i oc s 



where one takes s = (1 — x\ — ■ ■ ■ — x 2 ) 1 ^ 2 . □ 

Corollary A. 3. If the vector {x%, j/i, . . . , x n , y n ) is chosen at random from the 
(2n — l)-sphere according to normalized surface measure, then 

(fix, . . . ,/x„) = (xf + y\ , . . . , x\ + yl) 

is uniformly distributed on the (n — l)-simplex Y)fJ,j = 1. That is, 

E/(/xi, . . . ,/i n ) = (n — 1)! / f(si, . ..,s„_i, 1 - si s„_i) dsi ■ ■ ■ ds n -i 

J A 

where A = {(si, . . . , s„-i) : < si H h s n _i < 1}. 

Proof. By Lemma lA.ll we need only compute 

dx\dy\ ■ ■ ■ dx n 



/ f(xi,Vi, ■ ■ .,x n ,±y n ) ■ 



?7n 



1 < j < n - 1 



where y n = y/l - (x 2 + y 2 H \-x 2 n ). 

Let us change variables by 

Xj = cos(^-)V«7 

, n-l ^ 1/2 

x n = cos(cA„) f 1 - s k 
^ fc=l 

for which the Jacobian is y~ 1 dx±dyi ■ ■ ■ dx n — ds\ds2 ■ ■ ■ ds n -id<fii . . . d<fi n . Hence, 
up to proportionality constants that depend only on n, 

E/(/xx, . . . , n n ) oc / f(sx, . . . , s„_r, 1 - s 1 s n _i) dsi • • • ds n _i. 

The determination of the normalization constant is an easy calculation. A more 
general result will appear in the proof of Lemma fA. 41 □ 



Lemma A. 4. For (fi\, . . . , /i„) uniformly distributed on the (n— l)-simplex ^ fij = 
1 and pi, . . . ,p n real numbers greater than —\, we have 

(a.3) * E {n^}= ( "-^ !r(pi+1) - r(p ? +1) . 

Proof. This is sometimes known as Dirichlet's integral; our method of proof is a 
standard one. We begin with the product integral 



n />oo n 



3 = 1"" 3 = 1 
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and change variables to r = and fij = Sj/r. The Jacobian is given by 

r™ _1 dr dp,\ ■ ■ ■ dp n -i — dsi ■ ■ ■ ds n which leads us to 

n /> POO 

r[r(^ + l)=/ / Thrift e- r r n - l drdix x ---dti n -x 
j=1 JaJo 

= T(p! H \-Pn+n) J W^j 3 dflx - ■ ■ dfin^!. 



By taking p\ = ■ ■ ■ = p n = we recover the normalization constant of the 
/ij-integral given in Corollary I A. 31 Moreover, 

E {n $ } = (" - / n $ d ^ ■ ■ ■ ^"-i 

' } = (Ti-i)!r(pi + i)---r(p n + i) 

H h p n + n) 

as promised. □ 

Appendix B. The CMV Matrix and Geronimus Relations 

The purpose of this appendix is to describe some results in the theory of orthog- 
onal polynomials on the unit circle that were used in the main text; we also wish to 
show how easily they can be derived from the perspective of Cantero, Moral, and 
Velazquez, [3]. We begin with an outline of that work. 

Applying the Gram-Schmidt procedure to 1, z, z" 1 , z 2 , z~ 2 , . . . in L 2 (d[i) pro- 
duces the orthonormal basis 



(B.l) Xk(z) = 



z k / 2 4>* k (z) k even 
z^-Q^faiz) k odd 



where k > 0. As in the Introduction, <f>k denotes the ktYi orthonormal polynomial 
and 4>l, its reversal (cf. Q2.3|)l. If we apply the procedure to 1, z^ 1 , z, z~ 2 , z 2 , 
instead, then we obtain a second orthonormal basis: 



(B.2) X k (z)=Xk(l/z) 



z k / 2 4>k{z) k even 

z (-l-k)/*0*( z ) fcodd . 



It is natural to compute the matrix representation of f{z) i— > zf(z) in L 2 (dp) 
with respect to these bases. This was done in a rather cunning way. The matrices 
with entries 

U+xj+i = (xi(z)\zXj(z)) and M l+lj+1 = (X l (z)\xj(z)) 
are block-diagonal; indeed, 

(B.3) L = diag(S ,S 2 ,S 4 , . . .) and M = diag(3_i,Si, S3, . . .) 
where 3_i = [1] and 

„ _ &k Pk 
" k [Pk -o>k_ 

The representation of f(z) 1— > zf(z) in the Xj basis is just LM, which is the CMV 
matrix of the title of this appendix, and in the Xj basis, it is ML. 

If d/i is supported at finitely many points, say to, then a m _i will be unimodular 
and hence S m _i is diagonal. We replace S m _i by the lxl matrix that is its top 
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left entry, a m _i, and discard all Sfc with k > to. In this way, we find L and M are 
naturally m x to block-diagonal matrices. 

In a couple of places we noted that the spectral measure for any U € §©(2n) 
has last Verblunsky parameter equal to —1. The CMV matrix allows us to give a 
particularly short proof of this fact: 

Lemma B.l. Let dfi be a probability measure of the form 

3 

where Zi, . . . , z m are distinct points on the unit circle. If a m —i denotes the final 
Verblunsky parameter associated to this measure, then 

m 

(b.4) Y[ Zj = (-l^-Wi. 

Proof. As one might guess, (|B.4|I just represents two ways of calculating the deter- 
minant of the CMV matrix: the left-hand side is the product of the eigenvalues; 
the right-hand side is the product of the determinants of L and M. Note that 
det(Sfe) = — 1 for < k < to while det(S m _i) = a m —i- D 

Next we prove a result used in the derivation of the Aomoto integral: 

Proposition B.2. Given a finite system of Veblunsky parameters G D, < k < 
rn— 2, and a m _i = e 1 ^ , define a second system by = — e l< ^a m _2-fe; < k < to— 2. 
and a m -i — e 1 ^ . Then the monic orthogonal polynomials of degree to associated to 
these two systems are the same. 

Proof. If L and M are the matrices associated to the as and $ m the monic orthog- 
onal polynomial of degree to, then det(z — LM) = $ m (z). This is because both are 
monic polynomials vanishing at the eigenvalues of LM. 

Similarly, we have det(z — LM) — ( I > m (z) for the as. Hence the proposition will 
follow once we show that LM and LM are conjugate. We will give full details when 
to is even and a few remarks on the changes necessary when to is odd. 

Conjugating LM by 

"0 ... 1" 

... 10 

1 ... 

is equivalent to reversing the order of the rows and columns in each factor. As m 
is even, both L and M retain their structure. The transformation on L amounts 
to replacing a& by —a m ~i-k\ for M, we have the additional change that the (1, 1) 
entry becomes e~^ and the (m, m) entry, 1. 

To convert M to its proper structure (with 1 in the first position), we perform 
the transformations L i— > WLV and M >— > V'MU where 

U = diag(e^, 1, e** 1, . . .), V = diag(l, e^, 1, e*+, ...). 

In this way, we obtain the matrices L and M, which shows that LM is conjugate 
to the original LM. 

When to is odd, reversing the order of the rows and columns converts L to a 
matrix whose structure resembles that of M, while M is converted to an L-like 
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matrix. Proceeding as above shows that LM is conjugate to ML, and hence to 
LM. □ 

Let us now consider the case where the measure d/j, is symmetric with respect 
to complex conjugation, or what is equivalent, where all Verblunsky parameters 
are real. It is a famous observation of Szego (see ^3 §11.5]) that the polynomials 
orthogonal with respect to this measure are intimately related to the polynomials 
orthogonal with respect to the measure dv on [—2, 2] defined by 

(B.5) / f(z + z- 1 )dfi{z)= [ f{x)dv{x). 

JS 1 J-2 

As described in Section[5J the recurrence coefficients for these systems of orthog- 
onal polynomials are related by Geronimus relations: 

bk+i = (1 - a 2 k-i)a 2 k - (1 + ot 2 k-i)a2k-2 
(B.6) 1/2 
afe+i = {(1 - Q2fc-i)(l - oc 2k ){l + a 2 k+i)} 

We will now present a short proof of these formulae. As an off-shoot of our method, 
we also recover relations to the recurrence coefficients for (4 — x 2 ) dv(x) and (2 ± 
x)dv. The former appears in the proposition below, the latter in the remark that 
follows it. 

Proposition B.3. Let ak be the system of real Verblunsky parameters associated 
to a symmetric measure d/i and let L and M denote the matrices of (|B.3|) . Then 
LM + ML is unitarily equivalent to the direct sum of two Jacobi matrices: 



a i 







«i 
b 2 







J 



a i 







«i 

l>2 







where ak and bk are as in ljB~6)l and 

bk+i = (1 - a2k+i)ct2k - (1 + a 2 k+i)a2k-i 
a k+1 = {(1 + a 2 fe+i)(l - al k+2 ){l - a 2 fc+ 3 )} J 
Moreover, the spectral measure for (J, ei) is precisely the dv of (|B.5J| . The spectral 



1/2 



measure for (J, ei) is 



2(1- 



■{A-x l )dv{x). 



_)(l-ai)' 

Proof. Let S denote the following unitary block matrix 



S = diag([l], Si, $,,-■■) where S k 



1 

V2 



-VT~- 



OLk VT+Ofe 



VT~~a k _ 

which is easily seen to diagonalize M. Indeed, S^MS = diag(+l, —1, +1, — 1, . . . ). 
We will denote this matrix by R. 

The matrix LM+ML is unitarily equivalent to A = S ] {LM+ML)S = S^LSR+ 
RS^LS, which we will show is the direct sum of two Jacobi matrices. We begin by 
showing that even-odd and odd-even entries of A vanish, from which it follows that 
A is the direct sum of its even-even and odd-odd submatrices. 

Left multiplication by R changes the sign of the entries in each even-numbered 
row, while right multiplication by R reverses the sign of each even-numbered col- 
umn. In this way, RB + BR has the stated direct sum structure for any matrix B 
and hence, in particular, for B = S^LS. 
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It remains only to calculate the non-zero entries of A. As S and L are both tri- 
diagonal, A must be hepta-diagonal and so the direct sum of tri-diagonal matrices. 
Moreover, A is symmetric (because L is) so there are only four categories of entries 
to calculate: the odd/even diagonals and the odd/even off-diagonals. We begin 
with the diagonals: 



A 



«2fc-2 




O&fc-l VI = a 2k-l 
= (1 - a 2 k-i)a2k - (1 + a2fc-i)a2fc-2 

A 2k , 2k = - = a 2k-l \A + «2fc-l 

= (1 - a2fc-i)a2fe-2 - (1 + a 2 k-i)oi2k 





«2/c 



V 1 + «2/c-l 



\/l "2fc-l 



Q:2fc-2 






Q!2fe 



- yi - a 2 fc-i 

yi + «2fe-l 



Note that the factor of 2 resulting from A being the sum of two terms is cancelled 
by the factors of 2 -1 / 2 coming from S and S^. The calculation of the off-diagonal 
terms proceeds in a similar fashion: 



A 



2fc+l,2fc+3 — 



0' 

P2k 



[yi + a 2 k-i \/l - a 2 k-i 
\J{1- a 2fc -i)(l - + a 2 fe+i) 



yi + Q2fc+r 

yi - a2fc+i 



A 



2k,2k+2 



-yi - «2fe-i yi + «2fe-i 



■y/ (1 + a 2 fc-i)(l - al k )(l - a 2k+1 ). 



' 


0" 




/>2fc 








yn 



«2fe+l 



«2fe+l 



That dv is the spectral measure for (J, ei) is an immediate consequence of the 
spectral theorem, LM+ML = LM + (LM)~ 1 , and the fact that S 1 leaves the vector 
[1,0,..., 0] invariant. 

Tracing back through the definitions, we find that the spectral measure for (J, ei) 
is equal to that for the operator f(z) i— > (z + z^ 1 )f(z) in L 2 {dp) and the vector 

/(*) = (i**)* X2 (*) - (i^i)* Xl (z) = (i^i)^-v^(z) - (i^y^z). 

From the relations (|2.ip. I|2.2II . and (|2.4p. we find pxfi^z) — <fi*(z) — ol\z$i{z), 
Po4>i(z) = z — ao, and po0i( z ) = 1 — c^o 21 - These simplify the formula considerably: 

y — 7 

/(*) 



PoVW 



The expression for the spectral measure for (J, ei) now follows from the simple 



calculation \z 



(z + z- 1 ) 



1Y2 



□ 



Remark: In the above proof, we conjugated LM + ML by the unitary matrix which 
diagonalizes M. One may use the matrix diag(So, S 2 , . . which diagonalizes L, 
instead. This also conjugates LM + ML to the direct sum of two Jacobi matrices. 
In this way we learn that the recurrence coefficients for 2 (i±a ) ^ ^~ x )^ v i x ) are 
given by 

bk+i = ±(1 T a 2 fe)a2fe+i T (1 ± a 2k )a 2k ^i 

1 /2 

Gfc+l = {(IT "2fe)(l - oi 2 2k+1 ){l ± a 2 fe+2)} • 
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